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Abstract  We  present  a  small-footprint  search  protocol  de¬ 
signed  to  facilitate  any-type  queries  for  data  content  and 
services  in  large  population,  high-density  wireless  sensor 
networks.  Our  protocol,  termed  Trajectory-based  Selective 
Broadcast  Query  (TSBQ),  works  in  conjunction  with  time 
division  multiple  access-  or  schedule-based  medium  ac¬ 
cess  control  protocols  to  reduce  per-query  energy  expen¬ 
diture.  We  compare  the  performance  of  TSBQ  to  unicast- 
and  local  broadcast-based  search  algorithms  and  also  deter¬ 
mine  a  critical  node  density  based  on  the  energy  expended 
by  nodes  to  transmit  and  receive.  Minimal  energy  expen¬ 
diture  is  achieved  by  determining  the  optimal  number  of 
data/service  replicates  and  the  number  of  nodes  designated 
to  receive  each  query  transmission.  Numerical  results  indi¬ 
cate  that  TSBQ  significantly  reduces  the  total  energy  ex¬ 
penditure  of  a  network  as  compared  to  unicast  and  local 
broadcast-based  search  protocols. 
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1  Introduction 

Wireless  sensor  networks  (WSN)  are  formed  through  the 
cooperation  of  hundreds  or  thousands  of  small  sensing  de¬ 
vices.  These  devices,  called  nodes ,  are  linked  via  a  wireless 
transmission  medium  to  perform  tasks  in  a  distributed  man¬ 
ner.  Nodes  typically  have  limited  energy  stores,  transmission 
range,  local  storage  capacity,  and  computational  ability. 

It  is  envisioned  that  wireless  sensor  networks  will  soon 
be  composed  of  hundreds  of  thousands  or  even  millions  of 
nodes.  As  these  computing  devices  become  smaller  in  size, 
they  will  be  embedded  in  raw  materials  to  construct  build¬ 
ings,  bridges,  houses,  and  roads.  They  will  perform  func¬ 
tions  such  as  monitoring  the  integrity  and  security  of  struc¬ 
tures  and  territory  as  well  as  tracking  traffic  patterns,  ac¬ 
cidents,  and  weather.  In  an  emergency  situation,  such  net¬ 
works  will  alert  the  appropriate  personnel  autonomously,  re¬ 
ducing  response  time  and  saving  lives  and  assets.  As  sen¬ 
sor  networks  become  ubiquitous,  disparate  networks  will 
eventually  merge  due  to  proximity  and  necessity.  In  time, 
a  network  monitoring  the  structural  integrity  of  a  building 
might,  for  example,  communicate  with  the  network  moni¬ 
toring  nearby  roads  as  well  as  the  networks  embedded  in 
other  buildings. 

As  the  size  and  scale  of  wireless  sensor  networks  con¬ 
tinue  to  grow,  two  characteristics  will  be  critical  to  main¬ 
taining  their  viability.  First,  high  node  densities  (i.e.,  those 
for  which  each  node  has  a  large  number  of  one-hop  neigh¬ 
bors)  will  be  necessary  to  meet  an  increasing  demand  for 
high-precision  sensor  data  while  simultaneously  providing 
redundant  communication  paths  throughout  the  network. 
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High  node  density  also  results  in  increased  average  life¬ 
time  per  unit  density  of  the  network,  a  favorable  property 
in  networks  composed  of  large  numbers  of  low-cost,  unre¬ 
liable  nodes  [32].  Second,  small-footprint,  scalable,  energy- 
efficient  applications  will  remain  a  critical  enabling  technol¬ 
ogy.  Key  among  these  critical  applications  is  the  capability 
of  nodes  to  locate  data  and  services  within  the  network.  Due 
to  the  distributed  nature  of  data  collection  in  WSNs,  no  sin¬ 
gle  node  is  likely  to  have  all  the  information  necessary  to 
complete  a  particular  task.  However,  locating  information 
requires  nodes  to  expend  precious  energy  reserves  thereby 
reducing  node  and  network  lifetime.  Developing  an  energy- 
efficient  means  for  nodes  to  both  advertise  the  availability  of 
data  or  services  and  to  locate  these  items  within  the  network 
is  the  focus  of  this  paper. 

The  main  contributions  of  this  research  may  be  sum¬ 
marized  as  follows.  First,  we  develop  a  small-footprint 
search  protocol,  Trajectory-based  Selective-Broadcast  Query 
(TSBQ),  which  significantly  reduces  the  total  energy  used  to 
advertise  and  locate  data  and  services  within  a  network  as 
compared  to  unicast  and  local-broadcast  search  methods. 
Second,  we  provide  an  analytic  model  for  the  expected  to¬ 
tal  energy  expended  by  TSBQ  and  show  how  to  minimize 
the  expected  total  energy  expended  by  determining  simulta¬ 
neously  the  optimal  number  of  agent  replicas  and  the  num¬ 
ber  of  nodes  that  should  be  designated  as  receivers  for  each 
query  transmission.  Using  our  model,  we  predict  the  perfor¬ 
mance  variance  of  rumor  routing-based  search  protocols  and 
offer  a  means  to  minimize  this  variance.  Third,  by  means  of 
a  simulation  model,  we  evaluate  the  performance  of  TSBQ 
and  consequently,  we  propose  further  refinements  to  the  pro¬ 
tocol.  Finally,  we  elucidate  the  effect  of  network  boundaries 
and  incorporate  these  effects  into  the  mathematical  model. 

The  remainder  of  this  paper  is  organized  as  follows.  In 
Sect.  2,  we  discuss  related  work.  In  Sect.  3,  we  develop  and 
analyze  a  mathematical  model  for  the  expected  total  energy 
expenditure  of  the  TSBQ  protocol.  The  results  of  simula¬ 
tion  experiments  with  large,  high-density  networks  are  pre¬ 
sented  in  Sect.  4.  Based  on  the  results  of  these  experiments, 
we  propose  improvements  to  the  protocol  and  mathemati¬ 
cal  model.  Section  5  provides  conclusions  and  directions  for 
future  work. 


2  Related  work 

The  original  rumor  routing  protocol  [4]  and  several  of  its 
variants  [1,  2,  6,  27]  are  most  closely  related  to  our  proposed 
Trajectory-based  Selective  Broadcast  Query  (TSBQ)  search 
protocol.  In  rumor  routing  [4],  both  queries  and  resource  ad¬ 
vertisements  (called  “agents”)  travel  from  node  to  node  by 
means  of  a  modified  random  walk.  Nodes  in  receipt  of  an 
agent  are  capable  of  answering  related  queries  from  other 


nodes.  In  [6],  rumor  routing’s  dual  problems  of  spiraling 
agent/query  routes  and  ever-increasing  packet  size  (due  to 
the  need  to  record  previously-visited  nodes  to  prevent  back¬ 
ward  paths)  are  solved  by  forwarding  agents  and  queries  us¬ 
ing  straight-line  routing  (SLR).  REDMAN  [2]  is  similar  to 
SLR  in  that  agents  and  queries  are  forwarded  along  straight- 
line  trajectories.  However,  resource  replicas  are  stored  only 
at  every  kth  node  along  the  agent’s  path;  the  remaining  inter¬ 
mediate  nodes  store  a  pointer  to  the  nearest  available  replica. 
Zonal  Rumor  Routing  [1]  is  an  extension  of  rumor  rout¬ 
ing  that  partitions  the  network  into  artificial  zones  for  pur¬ 
poses  of  choosing  intermediate  nodes  for  agent/query  rout¬ 
ing.  Neighboring  nodes  assigned  to  unvisited  zones  are  fa¬ 
vored  when  choosing  an  agent  or  query’s  next  hop,  thus  im¬ 
proving  the  probability  of  a  successful  query.  With  respect  to 
our  research,  however,  we  have  noted  that  there  are  currently 
no  analytic  models  of  rumor  routing-based  search  protocols 
that  permit  determination  of  optimum  resource  replication 
levels  based  on  node  hardware  characteristics  and  resource 
popularity.  Moreover,  none  of  these  protocols  take  advan¬ 
tage  of  the  power  of  broadcast  transmissions,  nor  do  they  in¬ 
corporate  a  feedback-driven  caching  mechanism  to  improve 
latency  and  decrease  the  energy  expended  by  subsequent 
queries. 

Quorum-based  search  protocols  (see,  for  example, 
[12,  16,  25])  also  seek  to  facilitate  intersection  between 
queries  and  their  corresponding  agent  trajectories.  This  is 
typically  accomplished  by  forwarding  along  straight-line 
paths  in  each  of  the  four  cardinal  directions.  For  example, 
in  GCLP  [27]  both  agents  (called  “content  advertisements”) 
and  queries  are  propagated  along  straight-line  trajectories 
in  the  north-south  and  east-west  directions.  This  method 
guarantees  intersection  of  a  query  with  at  least  one  Content 
Location  Server  (i.e.,  a  node  aware  of  the  location  of  a  spe¬ 
cific  resource).  Quorum-based  schemes  can  also  achieve  a 
measure  of  energy  efficiency  by  aggregating  advertisements 
at  each  node  prior  to  transmission.  However,  most  quorum- 
based  schemes  require  nodes  to  maintain  sizeable  stores  of 
information  regarding  the  location  of  distant  nodes;  in  mo¬ 
bile  networks,  this  information  must  be  frequently  updated 
or  the  node  risks  returning  stale  information  in  response 
to  a  query.  Also,  quorum-based  search  protocols  treat  re¬ 
sources  with  equivalent  importance.  Both  popular  and  un¬ 
popular  items  consume  the  same  amount  of  network  storage 
capacity,  and  the  mean  energy  and  latency  required  to  lo¬ 
cate  both  popular  and  unpopular  items  are  the  same.  Our 
research  indicates  this  paradigm  forces  over-representation 
of  unpopular  items  within  the  network’s  aggregate  storage 
capacity  and  increases  the  total  energy  expended  for  popular 
item  queries. 

Rumor  routing  and  quorum-based  approaches  can  be  de¬ 
scribed  as  “geocentric”  because  the  dispersal  of  resource 
advertisements  and/or  replicates  is  based  on  network  topol¬ 
ogy  or  direction.  Such  approaches  differ  from  “data-centric” 
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search  algorithms  in  that  the  requesting  node  has  no  knowl¬ 
edge  of  the  location  of  the  desired  resource  when  it  is¬ 
sues  the  query.  As  an  alternative,  resources  in  data-centric 
networks  are  self-organized  to  facilitate  answering  queries. 
Data-centric  methods,  including  [20,  21,  23],  remove  uncer¬ 
tainty  regarding  the  location  of  resources  by  forwarding  data 
to  a  pre-defined  network  location,  node,  or  group  of  nodes 
based  on  specific  data  characteristics.  Queries  are  forwarded 
directly  to  the  node  or  nodes  responsible  for  caching  re¬ 
lated  data,  thus  decreasing  latency  and  energy  expenditure. 
However,  data-centric  algorithms  are  not  without  their  chal¬ 
lenges.  Most  importantly,  all  nodes  require  access  to  a  com¬ 
mon  hash  table  to  determine  where  to  send  their  resource 
advertisements  and  queries.  “Hotspotting,”  rapid  energy  de¬ 
pletion,  storage  capacity  overload,  and/or  congestion  can  oc¬ 
cur  if  a  single  node  or  location  becomes  the  repository  for 
a  large  amount  of  network  data  or  queries,  and  network¬ 
wide  updates  to  the  hash  table — when  needed — are  costly 
in  terms  of  energy  expenditure.  Scalability  can  also  be  prob¬ 
lematic  unless  a  load  balancing  method,  such  as  a  distributed 
hash  table,  is  used  (see,  for  example,  [14,  19,  22,  24,  33]). 
Finally,  because  nodes  are  subject  to  random  failure,  data 
redundancy  is  a  favorable  characteristic  to  prevent  data  loss. 
However,  data-centric  algorithms  tend  to  group  related  data 
in  close  proximity,  thus  increasing  the  severity  of  data  loss  in 
the  event  of  node  failure  or  network  partitioning.  To  achieve 
redundancy  in  data-centric  networks,  data  may  be  replicated 
at  nodes  located  in  close  proximity  to  the  hashed  location 
[23]  or  dispersed  throughout  the  network  in  a  geocentric- 
type  approach.  Although  data-centric  approaches  can  be  ef¬ 
fective,  we  favor  the  simplicity,  flexibility,  redundancy,  and 
data  dispersion  achieved  by  geocentric  approaches  in  net¬ 
works  composed  of  large  numbers  of  unreliable  nodes. 

There  also  exist  ongoing  and  relevant  efforts  to  develop 
efficient  replication  and  search  strategies  in  unstructured 
peer-to-peer  networks  [3,  7-9,  15].  However,  these  efforts 
are  primarily  focused  on  reducing  query  latency  instead 
of  increasing  energy  efficiency  because  the  computers  in 
those  types  of  networks  are  less  constrained  by  available  en¬ 
ergy,  local  storage,  and  computational  capability.  The  means 
by  which  to  evaluate  tradeoffs  between  important  network 
parameters — including  the  number  of  agent  replicas  stored 
in  the  network,  total  network  storage  capacity,  hardware 
power  requirements,  and  node  density — has  received  little 
attention  in  the  open  literature.  This  research  closes  that  gap 
by  providing  a  means  to  evaluate  the  effects  of  these  parame¬ 
ters  on  overall  energy  savings,  effective  total  network  stor¬ 
age  capacity,  query  response  variance,  and  query  latency. 

Although  TSBQ  is  inspired  by  traditional  rumor  routing, 
the  following  characteristics  make  it  unique: 

•  TSBQ  is  the  only  WSN  search  protocol  to  minimize  the 
total  expected  energy  expenditure  of  the  network  by  an¬ 
alytically  determining  the  optimum  number  of  resource 


replicates  created  by  each  agent.  Additionally,  TSBQ 
leverages  the  broadcast  nature  of  wireless  transmissions 
to  query  multiple  nodes  per  transmission,  thereby  reduc¬ 
ing  total  energy  expenditure. 

•  TSBQ  specifically  accounts  for  resource  popularity  as 
well  as  the  energy  expended  by  nodes  both  to  listen  and 
to  receive  when  determining  the  appropriate  number  of 
receivers  and  the  number  of  nodes  informed  via  agents. 
Additionally,  TSBQ  accounts  not  only  for  the  energy  ex¬ 
pended  to  inform  the  network  via  an  agent  and  locate  the 
desired  information  via  a  query  but  also  for  the  energy 
expended  to  return  the  response  to  the  originating  node. 
Achieving  maximum  energy  savings  requires  optimizing 
each  of  these  sources  of  energy  expenditure  simultane¬ 
ously. 

•  Nodes  need  only  maintain  one-hop  neighbor  information 
to  eliminate  redundant  node  querying.  Although  a  node 
may  receive  a  reissued  query  more  than  once  (cf.,  Sect.  4), 
this  can  be  eliminated  by  permitting  nodes  to  ignore  the 
PQN’s  broadcast  during  the  next  transmission  period  if 
they  received  the  query  from  the  QN  in  a  previous  trans¬ 
mission  slot. 

•  TSBQ  reduces  network  congestion  by  limiting  retrans¬ 
missions  of  the  query  packet  to  a  single  query  node  (QN), 
thus  avoiding  the  inherent  difficulties  and  inefficiencies 
associated  with  network  flooding. 

•  TSBQ  includes  a  feedback-driven  caching  mechanism 
to  reduce  search  latency  for  popular  data/services.  This 
mechanism  requires  negligible  additional  energy  expen¬ 
diture  by  the  network. 


3  Protocol  description 

It  is  well  known  that  nodes  can  conserve  energy  resources 
by  turning  off  transmitting  and  receiving  hardware  when 
not  in  use  [13,  18,  21,  28].  Several  medium  access  con¬ 
trol  (MAC)  protocols  such  as  S-MAC  [31],  D-MAC  [13], 
T-MAC  [28],  and  TRAMA  [18]  achieve  energy  savings  in 
this  manner.  TSBQ  takes  advantage  of  node  hardware  char¬ 
acteristics  and  the  energy  savings  of  time  division  multiple 
access  (TDMA)-based  MAC  protocols  to  determine  the  ap¬ 
propriate  advertising  and  query  strategy  for  the  network.  Al¬ 
though  all  nodes  must  participate  in  the  MAC’S  contention 
period  to  coordinate  transmission  and  reception  schedules, 
nodes  not  designated  to  transmit  or  receive  during  a  given 
transmission  period  are  permitted  to  enter  a  low-power  sleep 
mode.  We  then  seek  to  minimize  the  total  energy  expended 
by  simultaneously  determining  the  appropriate  number  of 
receivers  designated  by  the  MAC  during  each  transmission 
period  and  the  optimum  proportion  of  resource  replicates. 
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3.1  TSBQ  overview 

When  discussing  the  means  to  propagate  and  locate  infor¬ 
mation  within  a  network,  we  adopt  and  expand  much  of  the 
terminology  of  Braginsky  and  Estrin  [4].  Agents  are  packets 
transmitted  by  witness  nodes  to  advertise  the  availability  of 
specific  services  or  data.  Informed  nodes  have  received  an 
agent  transmission  and  stored  the  agent’s  content  in  a  local 
event  table.  A  node  seeking  data  or  a  particular  service  is 
the  origin  query  node  (OQN),  and  nodes  that  relay  query 
packets  on  behalf  of  the  OQN  are  query  nodes  (QN).  OQNs 
and  QNs  transmit  queries ,  packets  that  “roam”  the  network 
in  search  of  specific  services  or  data.  Receiving  nodes  (RN) 
adjust  their  sleep  cycles  to  accommodate  the  transmission 
schedules  of  neighboring  OQN/QNs  when  designated  by  the 
OQN/QN  to  receive  a  query  transmission.  When  a  query  is 
received  by  an  informed  node,  the  node  generates  a  response 
that  is  returned  to  the  OQN.  The  response  may  contain  the 
specific  data  requested  by  the  end-user  or  simply  provide  the 
location  of  the  desired  data  or  service. 

Two  basic  principles  motivate  the  development  of  TSBQ. 
First,  we  seek  a  balance  between  the  energy  expended  to  in¬ 
form  the  network  of  an  event  or  service  via  an  agent  and  the 
energy  required  to  locate  an  informed  node  via  a  query.  If 
too  few  nodes  are  informed,  less  energy  is  used  to  transmit 
agents  and  the  network  storage  burden  is  decreased.  How¬ 
ever,  a  query  will  likely  expend  additional  energy  to  lo¬ 
cate  an  informed  node  thereby  negating  any  potential  energy 
savings.  Conversely,  if  too  many  nodes  are  informed,  the 
amount  of  energy  expended  for  each  query  is  reduced,  but 
the  energy  required  to  propagate  each  agent  is  increased  and 
a  larger  portion  of  the  network’s  aggregate  storage  capacity 
is  consumed.  Second,  when  querying  neighboring  nodes,  we 
must  balance  the  number  of  nodes  that  receive  each  query 
transmission  and  the  energy  expended  by  these  nodes  to  re¬ 
ceive  the  query.  If  too  few  nodes  receive  the  query,  addi¬ 
tional  transmissions  may  be  required  to  locate  an  informed 
node.  By  contrast,  if  too  many  nodes  receive  the  query,  an 
informed  node  may  be  located  with  lower  latency,  but  the 
uninformed  receiving  nodes  still  pay  a  cost  for  receiving  the 
query  packet. 

The  TSBQ  search  protocol  consists  of  the  following 
steps: 

1.  A  node  witnesses  an  event  and  generates  an  agent  to  in¬ 
form  an  additional  (a A  —  1)  nodes,  where  A  is  the  num¬ 
ber  of  nodes  in  the  network.  To  ensure  the  value  (a  A  —  1) 
is  integral,  a  e  {1/A,  2/A, . . . ,  (A  —  1)/A}. 

2.  An  OQN  generates  a  query  and  chooses  a  random  di¬ 
rection  (trajectory)  for  routing.  Based  on  this  trajectory, 
the  OQN  chooses  the  next  potential  query  node  (PQN) 
from  among  its  one-hop  neighbors  using  the  Most  For¬ 
ward  within  Range  (MFR)  criterion  (Fig.  1)  [26]. 
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Fig.  2  RN  selection  region  (simplified  transmission  model) 


3.  The  OQN/QN  randomly  selects  (<$'  —  1)  RNs  from 
among  its  neighbors  that  are  closer  to  itself  than  the 
PQN  (Fig.  2),  where  8f  is  a  positive  integer  no  greater 
than  the  cardinality  of  the  node’s  neighbor  set,  8.  (The 
means  to  determine  the  optimum  value  of  8'  is  discussed 
in  Sect.  3.) 

4.  Transmission/reception  coordination  between  the 
OQN/QN  and  RNs  is  achieved  via  a  TDM  A-  or  schedule- 
based  MAC  protocol  during  the  contention  period.  The 
OQN/QN  sets  the  transmission-reception  schedule  for  its 
neighbors  and  designates  the  RNs.  Nodes  not  designated 
as  a  QN,  PQN,  or  RN  enter  sleep  mode  to  conserve  en¬ 
ergy  during  the  appropriate  transmission  period(s). 

5.  The  OQN/QN  broadcasts  the  query  to  the  PQN  and  the 
designated  RNs  (a  total  of  8'  receivers  per  query  trans¬ 
mission). 

6.  If  no  response  is  received  from  the  PQN  or  RNs  (i.e.,  the 
query  fails  to  locate  an  informed  node),  then  the  PQN  be¬ 
comes  the  next  QN.  The  new  QN  chooses  a  PQN  using 
MFR  along  the  designated  trajectory.  The  process  returns 
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Fig.  3  Graphical  depiction  of  the  TSBQ  protocol 


to  Step  3  and  repeats  until  the  query  is  successful  or  ter¬ 
minated. 

7.  If  at  least  one  PQN  or  RN  is  informed,  the  node  transmits 
the  desired  information  to  the  QN.  The  response  is  then 
returned  to  the  OQN  via  MFR  routing  along  the  trajec¬ 
tory  defined  by  the  positions  of  the  QN  and  OQN.  The 
query  is  terminated  by  the  PQN  once  it  overhears  the  re¬ 
sponse  transmitted  by  the  QN. 

8.  A  feedback-driven  caching  mechanism  may  be  incorpo¬ 
rated  to  enable  intermediate  nodes  along  the  route  from 
the  informed  node  to  the  OQN  to  add  the  information  in 
the  response  to  their  own  event  tables.  This  mechanism 
is  discussed  in  Sect.  4. 

The  partial  network  diagram  in  Fig.  3  is  a  graphical  de¬ 
piction  of  the  TSBQ  protocol.  The  black  arrow  is  the  OQN’s 
randomly-chosen  query  trajectory,  the  black  circles  are  the 
sequence  of  PQN/QN  nodes  responsible  for  transmitting  the 
query  at  each  hop,  and  the  gray  circles  designate  the  RNs 
randomly  polled  by  a  QN  to  determine  if  they  have  a  cor¬ 
responding  agent.  The  dashed  arrow  represents  the  trajec¬ 
tory  of  the  desired  agent,  and  an  “X”  indicates  a  node  is  in¬ 
formed.  Nodes  C4  and  D3  in  Fig.  3  have  received  and  stored 
a  copy  of  the  agent  sought  by  the  OQN.  In  this  example, 
each  node  has  approximately  5  =  18  one-hop  neighbors,  and 
8'  =  8.  The  means  to  analytically  determine  8 '  is  discussed 
in  Sect.  3.3. 

When  a  node  needs  a  non-local  resource  yet  has  no 
knowledge  of  the  resource’s  location,  the  node  designates 
itself  as  the  OQN  and  randomly  picks  a  query  trajectory. 
Based  on  this  query  trajectory,  the  OQN  selects  the  PQN 
(node  QNI  in  Fig.  3)  and  randomly  chooses  (8'  —  1)  =  7 
neighbors  (i.e.,  RNs)  from  among  those  nodes  closer  to  itself 
than  the  PQN.  After  coordinating  with  its  neighbors  during 
the  MAC  contention  period,  the  OQN  transmits  the  query 


to  the  PQN  and  the  RNs.  The  OQN’s  remaining  neighbor 
nodes  are  permitted  to  sleep  during  this  transmission  period. 
If  neither  the  PQN  nor  the  seven  RNs  polled  by  the  OQN  can 
answer  the  query,  the  PQN  will  query  a  subset  of  its  neigh¬ 
bors  on  behalf  of  the  OQN.  Although  not  depicted  in  Fig.  3, 
the  OQN’s  query  is  unsuccessful;  therefore,  node  QNI  must 
forward  the  query. 

Based  on  the  query  trajectory  chosen  by  the  OQN,  node 
QN  1  identifies  node  QN2  as  the  PQN  and  randomly  selects 
nodes  A1-A7  as  RNs.  Since  neither  QN2  nor  A1-A7  are 
informed,  QNl’s  query  fails,  and  QN2  assumes  responsi¬ 
bility  for  the  next  query  transmission.  QN2  chooses  a  PQN 
(QN3)  based  on  the  specified  query  trajectory  and  selects 
seven  RNs  (B1-B7).  Since  none  of  these  nodes  hold  a  copy 
of  the  desired  agent,  QN2’s  query  also  fails. 

Once  QN3  recognizes  QN2’s  query  has  failed,  it  iden¬ 
tifies  the  PQN  (QN4)  and  chooses  seven  RNs  (C1-C7). 
Upon  polling  these  nodes,  node  C4  responds  with  the  de¬ 
sired  information.  QN3  uses  this  information  to  generate 
a  response,  determines  the  appropriate  response  trajectory, 
and  returns  the  response  to  the  OQN.  When  QN4  overhears 
the  response  transmitted  by  QN3,  it  terminates  the  query. 

During  each  query  transmission,  it  is  possible  that  an  in¬ 
formed  node  is  a  neighbor  of  the  QN  but  not  located  because 
the  node  was  not  chosen  as  a  PQN  or  RN.  If  this  occurs, 
the  result  is  increased  delay  in  returning  a  response  to  the 
OQN  and  additional  transmissions.  Eliminating  this  possi¬ 
bility  can  only  be  achieved  by  transmitting  the  query  to  all 
neighboring  nodes.  However,  in  Sect.  3.4,  we  show  the  total 
expected  energy  expended  by  the  network  to  answer  a  query 
is  minimized  by  choosing  a  subset  of  a  node’s  neighbors  as 
receivers  when  the  node  density  exceeds  a  specific  thresh¬ 
old. 
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3.2  Analytic  model  of  TSBQ  energy  expenditure 

Three  primary  sources  of  network  energy  expenditure  are 
required  to  generate  a  successful  response  to  a  query:  agent 
transmission/reception,  query  transmission/reception,  and 
response  transmission/reception.  Achieving  the  minimum 
energy  expenditure  per  successful  query  requires  balancing 
these  elements.  We  discuss  each  individually  in  the  follow¬ 
ing  subsections. 

Although  the  MAC  protocol  is  also  a  source  of  energy 
expenditure,  evaluation  of  these  protocols  within  the  con¬ 
text  of  our  model  is  beyond  the  scope  of  this  paper;  we 
assume  the  energy  expended  by  the  MAC  protocol  per 
successful  transmission  is  approximately  constant  for  net¬ 
works  of  a  given  size  and  density.  Our  primary  performance 
metric — the  sum  total  energy  required  for  agent  transmis¬ 
sion/reception,  query  transmission/reception,  and  response 
transmission/reception — measures  the  total  amount  of  en¬ 
ergy  expended  by  nodes  during  contention-free  transmission 
periods. 

3.2.1  Agent  transmission/reception 

Traditional  rumor  routing  assumes  each  node  within  range 
of  an  agent  transmission  receives  the  agent  and  adds  the 
event  to  its  local  event  table.  This  results  in  a  “thick  line”  of 
informed  nodes  in  the  network  [4] .  However,  in  high-density 
networks,  this  approach  has  two  disadvantages:  first,  a  large 
percentage  of  the  total  network  storage  capacity  is  consumed 
by  these  agents.  Event  tables  of  nodes  located  near  active  ar¬ 
eas  of  the  network  will  likely  reach  their  capacities  quickly, 
requiring  a  replacement  strategy  for  event  table  entries — an 
undesirable  alternative.  Second,  unless  the  agent  time-to- 
live  (TTL)  value  is  high,  an  agent  may  not  be  transmitted 
to  distant  regions  of  the  network.  This  means  large  portions 
of  the  network  have  no  informed  nodes  (i.e.,  a  low  spatial 
dispersion  of  informed  nodes).  As  a  consequence,  networks 
using  traditional  rumor  routing  techniques  may  be  unable  to 
locate  an  informed  node  without  large  energy  expenditure. 

To  increase  the  spatial  dispersion  of  informed  nodes 
while  simultaneously  minimizing  the  number  of  transmis¬ 
sions,  we  propose  that  agents  be  forwarded  along  straight- 
line  trajectories  in  a  manner  similar  to  the  techniques  used 
in  [2,  17,  27].  Additionally,  to  minimize  local  storage  re¬ 
quirements,  each  agent  transmission  is  intended  for  exactly 
one  receiving  node  (i.e.,  agents  are  transmitted  in  a  unicast 
fashion).  Coordination  between  transmitting  and  receiving 
nodes  is  achieved  via  a  TDMA-  or  schedule-based  medium 
access  control  protocol,  such  as  T-MAC,  during  the  MAC 
protocol’s  contention  period.  During  the  transmission  pe¬ 
riod,  all  nodes  within  range  of  the  agent  transmission  not 
designated  as  receivers  deactivate  their  receiving  hardware 
to  conserve  energy.  The  intended  receiving  node  is  chosen 
using  MFR  to  eliminate  routing  loops  [26]. 


In  the  event  a  node  cannot  forward  an  agent  along  the  de¬ 
sired  trajectory  (e.g.,  due  to  encountering  a  network  bound¬ 
ary),  the  node  will  randomly  choose  a  new  forwarding  tra¬ 
jectory  for  the  agent.  (Alternatively,  if  the  agent  cannot  be 
forwarded  due  to  a  void  or  obstacle  within  the  network,  a 
face  routing  scheme — such  as  Greedy  Perimeter  Stateless 
Routing  [11] — can  be  used  to  circumvent  this  region  until 
the  desired  trajectory  can  be  resumed.  However,  in  the  de¬ 
sign  space  of  large-scale,  high-density  networks  using  MFR, 
the  probability  of  encountering  a  void  is  small  [30].  There¬ 
fore,  this  occurrence  is  not  included  in  the  development  of 
the  mathematical  model.)  Each  agent  is  forwarded  to  ex¬ 
actly  (aN  —  1)  unique  nodes,  thus  ensuring  there  are  aN 
informed  nodes. 

Once  a  node  receives  an  agent,  the  node  makes  an  en¬ 
try  in  its  event  table  that  includes  the  type  of  service/data 
advertised,  the  location  of  the  witness  node,  and  a  copy  of 
the  data  (if  available).  Although  any  node  that  overhears  an 
agent  transmission  may  add  the  agent  to  its  event  table,  we 
advocate  the  unicast  transmission  of  agents  between  nodes 
and  the  use  of  MFR  to  select  receivers  as  a  means  to  promote 
the  maximum  physical  distance  between  identical  event  ta¬ 
ble  entries.  This  reduces  the  probability  that  large  numbers 
of  informed  nodes  are  found  only  within  limited  portions  of 
the  network. 

If  we  let  A  denote  the  total  energy  used  to  propagate  each 
agent,  then  for  large  networks  such  that  a  1 ,  the  expected 
total  energy  used  to  propagate  each  agent  is 

E[A]  =  (Exmt  +  Ercv)  •  (aN  -  1),  (3.1) 

where  Exmt  is  the  energy  expended  by  a  node  to  transmit  a 
packet,  and  Ercv  is  the  energy  expended  to  receive  a  packet. 

3.2.2  Query  transmission/reception 

When  a  node  needs  access  to  services  or  data  but  has  no 
corresponding  entry  in  its  event  table,  the  node  generates 
a  query.  Because  nodes  may  selectively  activate  and  deac¬ 
tivate  their  receiving  hardware,  the  node’s  query  transmis¬ 
sion  may  be  received  by  one,  some,  or  all  of  its  one-hop 
neighbors  simultaneously.  If  we  momentarily  assume  that 
informed  nodes  are  spatially  uniformly  distributed  through¬ 
out  the  network  and  disregard  the  effect  of  network  bound¬ 
aries  (assumptions  we  will  revisit  in  Sect.  4),  the  number 
of  informed  nodes  that  are  also  neighbors  of  each  QN  is  a 
binomial  random  variable. 

Fet  Y  be  the  number  of  informed  nodes  within  one-hop 
distance  of  the  QN.  If  a  QN  has  8  neighbors  and  a  cor¬ 
responding  query  is  transmitted  to  8 '  of  these  neighbors, 
0  <  8'  <  8,  then  the  probability  of  failing  to  find  an  informed 
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node  is  obtained  by 


Pr{T  =  0}  = 


(3.2) 


and  the  probability  of  finding  at  least  one  informed  node  is 


Pr{T  >  0}  =  1  — 


aN 
N-  1 


8' 


(3.3) 


Note  that  we  assume  a  node  does  not  generate  a  query  for  a 
particular  service  or  data  if  it  is  already  informed.  As  a  con¬ 
sequence,  the  probability  that  an  uninformed  node’s  neigh¬ 
bor  possesses  the  data  of  interest  is  slightly  greater  than  a. 

In  TSBQ,  queries  are  forwarded  along  straight-line  tra¬ 
jectories  in  a  manner  similar  to  that  used  for  agents.  How¬ 
ever,  in  contrast  to  agent  transmissions,  queries  are  broad¬ 
cast  to  a  subset  of  each  node’s  neighbors.  Nodes  that  have 
not  been  designated  to  receive  a  particular  query  transmis¬ 
sion  turn  off  their  receivers  to  conserve  energy.  The  use 
of  straight-line  routing  trajectories  increases  the  probabil¬ 
ity  that  a  subset  of  the  QN’s  neighbors  have  not  yet  received 
the  current  query  compared  to  random  walk  methods.  There¬ 
fore,  the  probability  of  finding  an  informed  node  increases 
with  each  hop  of  the  query  along  its  assigned  trajectory.  Let 
Zj  be  a  Bernoulli  random  variable  denoting  success  or  fail¬ 
ure  of  the  jth  query  hop  (transmission)  such  that  Zj  =  0 
when  the  jth  query  hop  fails  to  locate  an  informed  node  and 
Zj  =  1  otherwise.  If  a  query  is  broadcast  to  a  unique  set  of 
8 '  receivers  at  each  hop  in  its  path,  the  probability  that  the 
jth  query  transmission  fails  to  locate  an  informed  node  is 

/  aN  \8' 

Pr {Z;  =  0}  =  1 - ,  j  >  1.  (3.4) 

1  J  J  V  N  -  1  -  (j  -  1)8 ' 


Clearly,  sensor  networks  are  comprised  of  a  finite  num¬ 
ber  of  nodes.  Assuming  a  query  can  be  propagated  without 
encountering  a  network  boundary,  the  maximum  number  of 
query  transmissions  that  can  be  made  to  unique  neighboring 
nodes  before  locating  at  least  one  informed  node  is 


k  := 


N(l  —  a)  -  1 
8' 


+  1, 


ae{l/N,2/N,...,(N  -1)/N}. 


(3.6) 


Equation  (3.6)  assumes  that  at  least  one  node  in  the  network 
has  not  received  a  copy  of  the  agent;  otherwise,  there  would 
be  no  need  for  a  node  to  generate  a  query.  Let  Xa^  denote 
the  random  number  of  transmissions  required  to  find  an  in¬ 
formed  node  for  fixed  values  of  a  and  8'.  Then  the  probabil¬ 
ity  of  requiring  j  query  transmissions  is 


Pr  {Xa,8'  =  j} 


1-1- 


aN 
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1  —  (  max 
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(3.7) 


N  —  1  —  (/  —  1)8' 
and  the  expected  value  of  Xajr  is 
k 

E[Xa,S,]  =  J2j  -MXa8'  =  j}- 

j= 1 


,  2  <  j  <  k 


(3.8) 


Let  Q  be  the  energy  expended  by  the  network  to  locate 
an  informed  node.  The  use  of  straight-line  trajectories  for 
forwarding  queries — and  assuming  no  redundant  polling  of 
nodes — implies  that  the  expected  energy  used  to  forward  a 
query  can  be  derived  from  (3.7)  as 


If  an  informed  node  is  found  on  the  jth  hop,  then  an  in¬ 
formed  node  was  not  located  on  the  previous  (j  —  1)  hops 
because  a  query  is  not  propagated  further  once  an  informed 
node  is  found.  (Recall  that  we  are  performing  an  any- type 
search;  therefore,  the  search  is  concluded  when  at  least  one 
copy  of  the  desired  information  is  located.)  Consequently, 
the  probability  of  locating  an  informed  node  for  the  first  time 
on  the  jth  hop  is 


Pr{Zi  =  Z2  =  •  •  •  =  Zj- 1  =  0,  Zj  =  1} 


1-1- 


aN 
N  —  1 


8' 


j  =  h 


1-1 


aN 


N  -  1  -  (j  -  1)5' 


(3.5) 
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aN 


N  —  1  —  (i  —  1)8' 


,  J  >2. 


E[Q]  =  n  •  (Exmt  +  8’  •  Ercv)  •  E[Xa^l  (3.9) 

where  n  is  the  total  number  of  unique  queries  generated  by 
n  OQNs  to  locate  a  particular  agent.  Note  that  the  number 
of  informed  nodes,  aN,  is  assumed  to  be  constant  for  all 
n  queries.  Although  the  number  of  informed  nodes  should 
increase  as  queries  are  answered,  we  make  no  temporal  as¬ 
sumptions  regarding  the  generation  of  queries  or  responses. 
Hence,  (3.9)  represents  an  upper  bound  for  the  expected  en¬ 
ergy  expended  by  the  network  to  locate  an  informed  node. 
Additionally,  the  value  of  n  may  be  set  prior  to  deploy¬ 
ment  based  on  analysis  of  the  network’s  application(s),  or 
it  may  be  updated  dynamically  if,  for  example,  one  or  more 
nodes  recognize  the  number  of  unique  requests  for  a  par¬ 
ticular  resource  exceeds  a  specified  threshold.  Alternatively, 
a  feedback-driven  caching  mechanism  can  be  used  (cf., 
Sect.  4.3). 
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3.2.3  Response  transmission/reception 

Once  the  desired  information  is  located,  the  response  is 
returned  to  the  OQN.  Although  we  assume  intermediate 
nodes  in  the  response  path  are  chosen  using  MFR  along 
the  straight-line  trajectory  defined  by  the  current  QN  and 
OQN,  there  are  several  energy-efficient  routing  protocols 
that  could  perform  this  function.  Most  notably,  Span  [5]  and 
GAF  [29]  provide  point-to-point  routing  services  and  are 
specifically  designed  to  reduce  energy  expenditure  by  max¬ 
imizing  the  number  of  nodes  in  the  sleep  state. 

Let  R  denote  the  energy  used  by  the  network  to  return 
a  response  to  the  OQN.  Assuming  the  query  does  not  en¬ 
counter  a  network  boundary  prior  to  locating  an  informed 
node,  the  expected  number  of  transmissions  to  return  the  re¬ 
sponse  is  identical  to  the  expected  number  of  query  trans¬ 
missions  required  to  locate  the  informed  node.  Then  the  ex¬ 
pected  energy  required  to  return  n  responses  to  n  OQNs  is 

E[R]=n-  (Exmt  +  Ercv)  '  E\Xa^].  (3.10) 

3.2.4  Expected  energy  requirement 

The  total  energy  T  required  to  propagate  an  agent,  its  asso¬ 
ciated  query(ies),  and  response(s)  is  the  sum  of  (3.1),  (3.9), 
and  (3.10).  An  additional  transmission  and  reception  must 
be  added  for  each  query  since  an  informed  node,  once  lo¬ 
cated,  must  advise  the  current  QN  the  desired  information 
has  been  found.  Therefore,  the  expected  total  energy  ex¬ 
pended  by  the  network  to  generate  n  unique  responses  is 

E[T]  =  (aN  —  1  +  n)(Exmt  +  Ercv) 

+  (2 nExmt  +  n(8'  +  1  )Ercv)  •  E[Xa  ^].  (3.11) 


3.3  Minimizing  expected  total  energy  expended 

The  main  objective  of  the  TSBQ  protocol  is  to  minimize 
the  expected  total  energy  expended  by  the  network  to  gen¬ 
erate  n  successful  responses  to  n  queries  for  the  desired 
data/service.  Therefore,  whenever  Ercv,  Exmt,  N ,  and  n  are 
known,  we  seek  to  select  the  optimal  pair  (a,  8 r)  that  mini¬ 
mizes  (3.11). 

We  now  formalize  the  problem  and  its  solution  proce¬ 
dure.  To  emphasize  the  explicit  dependence  of  (3.1 1)  on  the 
decision  variables  a  and  8\  let  f(a,  8 r)  =  E[T ]  denote  the 
expected  total  energy  expended  by  the  network.  The  mathe¬ 
matical  programming  formulation  is  as  follows: 

Minimize  f(a,8f) 

Subject  to: 

ae{l/N,2/N,...,(N  -l)/N}, 

8'  e{  1,2, 

For  a  finite  network,  f(a,  8 ')  is  a  discrete  function  on  a 
feasible  region  with  (N  —  1)  •  8  possible  solutions.  There¬ 
fore,  the  mathematical  program  is  a  straightforward  discrete 
optimization  problem  in  which  the  minimum  energy  expen¬ 
diture  may  be  obtained  by  enumerating  all  possible  com¬ 
binations  of  ( a ,  8'),  and  then  choosing  the  (a,  8 ')  pair  that 
results  in  the  least  total  energy  expended.  We  refer  to  the 
pair  of  a  and  8 '  values  that  result  in  the  minimum  expected 
energy  expenditure  as  (a*,  5'*).  A  partial  graph  of  the  ob¬ 
jective  function  for  a  5000-node  network  is  shown  in  Fig.  4. 
Note  that  we  normalize  the  expected  total  energy  expended 
by  the  energy  expended  for  node  transmission;  we  also  as¬ 
sume  0  <  Ercv  <  Exmt.  The  Ercv/Exmt  ratio  is  defined  by  the 
hardware  characteristics  of  the  nodes  and  sizes  of  the  trans¬ 
mitted  packets.  It  can  also  include  the  energy  expended  by 
the  MAC  layer  for  transmissions  and  retransmissions. 


Fig.  4  Plot  of  f(a,8'), 
N  =  5000,  n  =  1, 

Ercv/Exmt  —  0.7 
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Fig.  5  Effect  of  N  and 
Ercvl  EXmt  OH  OL  ,  Tl  =  1 


Fig.  6  Effect  of  N  and 
Eycv!  EXmt  OH  8  ,  fl  =  1 


We  now  examine  the  effect  of  increased  network  size  and 
various  Ercv/Exmt  ratios  on  the  optimal  (a,  8')  pair.  The  re¬ 
sults  of  this  analysis  for  a  wide  range  of  network  sizes  are 
shown  in  Figs.  5  and  6  for  the  single-query  case  (i.e.,  n  =  1), 
and  the  minimum  expected  total  energy  expended  is  shown 
in  Fig.  7.  For  example,  if  Ercv/Exmt  =  0.5,  (a*,  <$'*)  for  a 
50000-node  network  is  (0.00266,59),  and  the  expected  total 
energy  expended  (normalized)  is  419.6. 

3.4  Approximating  the  optimal  solution 

Although  (a*,  5'*)  can  be  obtained  for  a  network  of  fixed 
size,  density,  and  Ercv/Exmt  ratio  via  explicit  enumerations, 
this  method  imposes  a  high  computational  requirement  for 
N  very  large.  In  the  worst  case,  the  optimization  program  re¬ 
quires  0(N )  floating-point  additions,  0(N 2)  floating-point 


multiplications,  and  0(N 2)  floating-point  exponential  op¬ 
erations.  For  extremely  large,  dense,  networks,  it  may  not 
be  feasible  to  carry  out  this  analysis.  Additionally,  the  para¬ 
meters  that  characterize  a  newly  deployed  network  will  al¬ 
most  certainly  change  during  the  network’s  useful  lifetime, 
requiring  the  optimal  solution  to  be  periodically  updated. 
Ideally,  it  would  be  advantageous  to  express  a*  and  8'*  as 
functions  of  N  and  Ercv/Exmt. 

Regression  analysis  of  the  curves  in  Figs.  5  and  6  reveals 
that  the  power  model  provides  an  excellent  fit  to  our  nu¬ 
merical  results,  yielding  correlation  coefficients  greater  than 
0.999.  The  generalized  power  model  is 

A  =  BC(x)p ,  (3.12) 
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Fig.  7  Expected  minimum 
energy  expended  using  (a*, 
<$'*),  #i  =  l 


where  A  is  the  dependent  variable,  C(x)  is  the  independent 
variable,  and  B  and  p  are  constants.  The  following  equa¬ 
tions  are  used  to  determine  a*  and  8'*  as  a  function  of  the 
network  size  N: 

a*  =b\  •  NPI , 

(3.13) 

8'*  =  b2-Np\ 

where  b\,  b2,  p\,  p2  are  constants  for  a  fixed  Ercv/Exmt  ra¬ 
tio. 

The  regression  analysis  reveals  the  following:  first,  the 
value  of  a  resulting  in  the  smallest  total  energy  expenditure 
for  a  fixed  Ercv/Exmt  ratio  is  inversely  proportional  to  the 
square  root  of  A  (i.e.,  p\~—  0.5),  and  b\  increases  as  the 
Ercv/Exmt  ratio  increases.  Hence,  as  network  size  increases, 
the  minimum  expected  energy  expenditure  is  obtained  by 
using  a  smaller  percentage  of  informed  nodes.  This  prop¬ 
erty  has  the  added  benefit  of  reducing  the  percentage  of  to¬ 
tal  network  storage  capacity  required  by  each  unique  agent, 
decreasing  the  probability  that  nodes  will  need  to  employ  an 
event  table  entry  replacement  protocol.  Second,  the  value  of 
8'*  for  a  fixed  Ercv/ Exmt  ratio  is  approximately  proportional 
to  the  fourth  root  of  N  (i.e.,  p2  ~  0.265),  indicating  that  8'* 
increases  at  a  much  slower  rate  than  the  size  of  the  network. 
As  the  Ercv/Exmt  ratio  increases,  b2  decreases,  thus  reflect¬ 
ing  the  increased  cost  of  receiving  a  transmission. 

The  value  of  8 also  defines  the  threshold  one-hop 
neighbor  density  required  to  achieve  the  most  energy- 
efficient  search  performance.  As  the  average  size  of  a 
node’s  neighborhood  increases  beyond  the  values  indicated 
in  Fig.  6,  TSBQ  is  more  efficient  than  local  broadcast  (i.e., 
transmitting  the  query  to  all  of  a  node’s  one-hop  neighbors). 
However,  when  8  is  less  than  8'*/(  1  —  ci),  where  c\  is  the 
average  proportion  of  shared  neighbors  between  each  QN 


and  PQN,  the  query  should  be  broadcast  to  a  node’s  closest 
neighbors  to  reduce  total  energy  expenditure.  In  this  case, 
local  flooding  is  simply  a  special  case  of  TSBQ  in  which  the 
computed  value  of  8 r*  is  greater  than  8(1  —  c\). 

If  8 '  is  decreased  below  the  values  in  Fig.  6,  the  expected 
total  energy  expenditure  increases  due  to  the  larger  number 
of  query  transmissions  required  to  locate  an  informed  node. 
The  unicast  query  model,  in  which  each  query  transmission 
is  intended  for  a  single  receiver,  defines  the  largest  possi¬ 
ble  reduction  in  8\  i.e.,  8'  =  1.  The  expected  total  energy 
expenditure  for  the  unicast  rumor  routing  model,  similar  to 
that  used  in  SLR  [6],  can  be  computed  using  (3.11)  by  sub¬ 
stituting  8'  =  1 .  However,  analysis  of  the  unicast  model  in¬ 
dicates  much  larger  values  of  a  are  required  to  achieve  the 
minimum  energy  expenditure,  and  the  minimum  energy  ex¬ 
penditure  of  the  unicast  model  exceeds  that  of  TSBQ.  For 
example,  in  a  20000-node  network  with  an  Ercv/Exmt  ratio 
of  0.7  and  n  =  1,  the  minimum  E[T]  of  TSBQ  consumes 
50.2%  less  energy  than  the  unicast  query  strategy  (338.7 
versus  680.0  normalized  energy  units).  Additionally,  TSBQ 
requires  only  94  informed  nodes  per  agent  to  achieve  min¬ 
imum  E[T]  versus  199  for  the  unicast  protocol,  a  52.8% 
reduction  in  total  network  storage  capacity  consumed  per 
agent.  For  the  20000-node  network,  Fig.  8  shows  the  min¬ 
imum  total  energy  expended  by  TSBQ  ranges  from  45.5% 
to  75.0%  less  than  trajectory-based  unicast  search  protocols, 
such  as  SLR. 

Additional  analysis  of  the  model  reveals  the  value  of  a* 
increases  by  a  factor  of  approximately  3.4  for  each  order 
of  magnitude  increase  in  n  (Fig.  9),  and  8'*  decreases  by 
a  factor  of  approximately  2.0  for  each  order  of  magnitude 
increase  in  n  (Fig.  10).  This  result  is  consistent  with  in¬ 
tuition:  minimum  E[T]  is  achieved  by  advertising  popular 
data/services  to  a  larger  portion  of  the  network,  thus  per- 
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Fig.  8  Minimum  E[T]  of 
TSBQ  versus  unicast  search 
protocols,  N  =  20000,  n  =  1 


Fig.  9  Effect  of  n  on  a*,  TSBQ 
protocol,  N  =  20000 


mitting  the  energy  costs  related  to  advertising  to  be  amor¬ 
tized  over  a  larger  number  of  queries.  When  a  popular  item 
is  heavily  advertised,  we  expect  to  locate  the  desired  infor¬ 
mation  using  fewer  transmissions.  Accordingly,  8 '  should 
be  decreased  to  achieve  the  minimum  total  energy  expen¬ 
diture  when  an  item  is  popular  and  heavily  advertised,  while 
8'  should  be  increased  to  locate  less  popular  (and,  hence, 
lightly  advertised)  items. 

In  contrast  to  TSBQ,  unicast  search  algorithms  require 
a  higher  proportion  of  informed  nodes — regardless  of  the 
Ercv/Exmt  ratio — to  achieve  minimum  E[T].  As  shown  in 
Fig.  11,  the  value  of  a*  for  the  unicast  search  protocol  is 
unaffected  by  the  Ercv/Exmt  ratio,  and  this  value  always  ex¬ 
ceeds  the  corresponding  a*  value  for  TSBQ.  This  is  due 


to  the  fact  that  unicast  protocols  cannot  take  advantage  of 
efficiencies  gained  by  querying  multiple  nodes  per  trans¬ 
mission. 

4  Simulation  results 

We  have  demonstrated  how  our  mathematical  model  may 
be  used  to  minimize  the  total  expected  energy  expended  to 
locate  services  and  data  within  a  WSN.  However,  as  noted 
in  Sect.  3.2.2,  the  analytic  model  makes  two  simplifying  as¬ 
sumptions.  First,  we  assume  informed  nodes  are  spatially 
uniformly  distributed  throughout  the  network.  Second,  the 
analytic  model  does  not  explicitly  account  for  the  proba¬ 
bility  of  a  query  encountering  a  network  boundary  prior  to 
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Fig.  10  Effect  of  n  on  <$'*, 
TSBQ  protocol,  N  =  20000 


Fig.  11  Effect  of  n  on  a*, 
unicast  search,  N  =  20000 


locating  an  informed  node.  To  examine  the  significance  of 
these  assumptions  on  our  analytic  model,  we  now  compare 
the  predicted  performance  of  TSBQ  to  the  results  of  simu¬ 
lation. 

Section  4.1  explains  the  construction  of  our  network 
simulator.  Section  4.2  examines  the  impact  of  network 
boundaries  on  the  predictive  value  of  our  mathematical 
model,  and  Sect.  4.3  assesses  the  effects  of  trajectory-based 
forwarding — and  the  resulting  non-uniform  distribution  of 
informed  nodes — on  performance.  To  improve  performance, 
a  simple  feedback  mechanism  is  proposed  that  imposes 
negligible  additional  energy  cost.  Section  4.4  evaluates  the 
predicted  and  observed  variance  of  energy  expenditure  per 


query.  Finally,  based  on  our  simulation  results,  Sect.  4.5  pro¬ 
poses  an  improved  mathematical  model  that  incorporates 
network  boundaries. 

4.1  Simulation  construction 

To  accommodate  the  large,  dense  networks  of  nodes  needed 
to  evaluate  the  performance  of  the  TSBQ  protocol,  we  de¬ 
velop  a  network  simulation  using  MATLAB.  Since  our  an¬ 
alytic  model  assumes  a  reliable  channel,  no  collisions,  and 
retransmissions  managed  by  the  MAC  layer  (although  these 
effects  are  indirectly  included  in  the  analytic  model  via 
the  Exmt  and  Ercv  parameters),  a  MATLAB -based  simula- 
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Fig.  12  TSBQ  performance, 
5000-node  network,  8'  =  21, 

ErCv/ Exmt  =  0.7 


tion  was  well-suited  for  our  purposes.  Thus,  we  were  able 
to  obtain  in  a  reasonable  time  1000  replicates  per  set  of 
parameters — and  ensure  the  stability  of  the  simulation  on 
a  standard  desktop  PC. 

The  simulator  generates  networks  of  N  randomly-placed 
nodes  within  the  confines  of  a  user-defined  square  deploy¬ 
ment  region.  To  simplify  the  process  of  determining  the  set 
of  neighbors  of  each  node,  we  assumed  a  circular  radio 
propagation  model  and  specified  the  maximum  transmis¬ 
sion  range  that  results  in  the  minimum  acceptable  Eb/N0 
for  each  node.  Although  our  transmission  model  is  some¬ 
what  unrealistic  for  indoor  environments,  it  has  been  found 
to  be  accurate  for  modeling  outdoor  WSNs  [10].  Regardless, 
TSBQ  does  not  require  an  isotropic  transmission  range  for 
proper  operation. 

The  simulation  follows  the  steps  of  the  TSBQ  protocol 
outlined  in  Sect.  3.  First,  randomly-selected  witness  nodes 
forward  an  agent  to  (a N  —  1)  unique  nodes.  Once  the  agents 
have  informed  the  network,  randomly-selected  uninformed 
nodes  generate  queries.  Prior  to  each  query  transmission,  the 
transmitting  node  selects  a  PQN  and  also  randomly  chooses 
8 '  of  its  closest  one-hop  neighbors  as  receiving  nodes  from 
among  those  nodes  closer  to  the  current  QN  than  either  the 
PQN  or  the  previous  QN.  Although  our  node  transmission 
model  results  in  a  well-defined  region  for  choosing  RNs 
(Fig.  2),  irregularly- shaped  one-hop  neighborhoods  can  be 
accommodated  by  permitting  designated  RNs  to  turn  off 
their  receivers  if  they  determine  they  have  already  received  a 
copy  of  a  particular  query.  Once  an  informed  node  is  found, 
the  response  is  returned  to  the  OQN.  The  mean  total  energy 
expended  to  inform  the  network,  answer  each  query,  and  re¬ 
turn  the  response  is  reported  at  the  completion  of  1000  in¬ 
dependent  trials  for  each  (a,  S')  pair.  Our  simulations  con- 


Table  1  Simulation  model  parameters 


Network  Deployment  Effective  node  Average  one-hop 

size  (N)  area  transmission  neighborhood 

range  size  (5) 


5000  nodes 

30000  m2 

11  m 

63 

10000  nodes 

59395  m2 

11  m 

64 

20000  nodes 

97470  m2 

11  m 

78 

sisted  of  testing  5000-,  10000-,  and  20000-node  networks 
using  the  parameters  summarized  in  Table  1 . 

The  average  run-time  for  each  simulation  varies  based 
on  several  user-defined  parameters,  including  the  number 
of  nodes  in  the  network  and  the  number  of  replications  of 
each  experiment.  However,  using  a  3.2  GHz  Pentium  IV 
computer  and  1000  replicates  per  data  point,  the  results  pre¬ 
sented  in  the  next  subsection  required  approximately  6  hours 
for  the  5000-node  network,  17  hours  for  the  10000-node  net¬ 
work,  and  56  hours  for  the  20000-node  network. 

4.2  Effect  of  network  boundaries  on  performance 

The  mathematical  model  of  the  expected  energy  require¬ 
ment  assumes  a  uniform  distribution  of  informed  nodes. 
Therefore,  to  study  the  effect  of  network  boundaries  on  the 
performance  of  our  protocol,  the  simulation  was  permitted 
to  randomly  choose  aN  informed  nodes,  thus  permitting  an 
assessment  of  the  performance  of  TSBQ  free  of  the  effects 
of  the  agent  routing  method.  (The  impact  of  trajectory  rout¬ 
ing  on  system  performance  is  evaluated  in  Sect.  4.3.) 

The  results  of  these  simulations  for  5000-,  10000-,  and 
20000-node  networks  are  shown  in  Figs.  12,  13,  and  14, 
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Fig.  13  TSBQ  performance, 
10000-node  network,  8'  =  32, 

Ercv/ EXmt  —  0-2 


Fig.  14  TSBQ  performance, 
20000-node  network,  8'  =  39, 

Ercv/ Exmt  —  0.7 
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respectively.  Each  data  point  represents  the  average  perfor¬ 
mance  of  1000  independent  simulation  runs.  With  the  ex¬ 
ception  of  the  smallest  values  of  a  (e.g.,  a  <  0.004  for  the 
5000-node  case),  the  value  of  E[T]  predicted  by  (3.1 1)  was 
within  the  95%  confidence  interval  of  the  simulation  re¬ 
sults.  The  observed  results  at  lower  values  of  a  differ  from 
the  mathematical  model  due  to  a  large  number  of  queries 
dropped  by  the  network  at  a  boundary  prior  to  discovering 
an  informed  node.  When  this  event  occurred  in  our  simula¬ 
tions,  the  OQN  was  forced  to  reissue  the  query  along  another 
randomly-chosen  trajectory  after  an  appropriate  timeout  pe¬ 
riod.  Since  we  did  not  limit  the  OQN’s  choice  of  trajectories 
for  reissued  queries  in  the  simulation  model,  a  node  may 
receive  the  same  query  more  than  once  if  subsequent  trajec¬ 


tories  are  similar  to  the  original.  We  note  that  TSBQ  is  de¬ 
signed  to  prevent  nodes  from  receiving  transmissions  of  the 
same  query  on  subsequent  hops.  It  does  not  attempt  to  pre¬ 
vent  nodes  from  being  queried  more  than  once  by  reissued 
queries.  However,  further  energy  savings  can  be  obtained  by 
allowing  nodes  to  turn  off  their  receivers  once  they  deter¬ 
mine  a  given  query  has  already  been  received. 

Based  on  these  results,  we  conclude  that  the  mathemat¬ 
ical  model  is  useful  for  predicting  the  performance  of  the 
network  if  the  actual  proportion  of  informed  nodes  is  not 
significantly  smaller  than  a*.  However,  the  predictive  capa¬ 
bility  of  the  model  can  be  improved  at  small  values  of  a  by 
extending  (3.11)  to  include  parameters  associated  with  the 
network  deployment  area  and  the  transmission  range  of  the 
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nodes.  Section  4.5  explains  how  to  extend  the  mathematical 
model  in  this  manner. 


4.3  Effect  of  trajectory-based  forwarding  of  agents 


Although  our  mathematical  model  assumes  a  spatially  uni¬ 
form  distribution  of  informed  nodes,  such  a  distribution  of 
informed  nodes  is  difficult  to  achieve  in  real-world  networks 
due  to  the  limited  transmission  range  of  nodes.  A  uniform 
distribution  of  informed  nodes  might  be  attained  by  artifi¬ 
cially  partitioning  the  network  into  equal- size  zones — such 
as  those  used  in  Zonal  Rumor  Routing  [1] — or  by  guaran¬ 
teeing  at  least  k-hop  distance  between  identical  event  table 
entries  using  a  method  such  as  k-DID  [2],  but  such  schemes 
require  additional  energy  expenditure  and  increase  complex¬ 
ity.  Also,  algorithms  such  as  k-DID  have  been  found  to  scale 
poorly  in  dense  networks  [2].  Instead,  we  propose  to  route 
agents  along  randomly-chosen  straight-line  trajectories  and 
use  MFR  to  choose  intermediate  receivers  to  achieve  max¬ 
imum  initial  spatial  dispersion  of  informed  nodes  in  the 
fewest  possible  transmissions.  As  a  consequence,  we  expect 
mean  per-query  energy  expenditure  to  differ  from  that  pre¬ 
dicted  by  the  mathematical  model,  especially  at  lower  values 
of  a,  due  to  a  spatially  non-uniform  distribution  of  informed 
nodes  and  queries  encountering  a  network  boundary  prior  to 
locating  an  informed  node. 

To  examine  the  effects  of  straight-line  forwarding  of 
agents  on  overall  energy  expenditure,  additional  simulation 
experiments  were  conducted  using  the  parameters  in  Ta¬ 
ble  1.  The  results  of  these  simulations  are  shown  in  Figs.  15, 
16,  and  17.  Each  data  point  represents  the  average  perfor¬ 
mance  observed  over  1000  independent  simulation  runs. 

As  predicted,  informing  nodes  via  trajectory-based  for¬ 
warding  results  in  differences  between  the  predicted  and  ob¬ 
served  mean  per-query  energy  expenditures;  however,  the 
general  trend  of  the  results  follows  that  predicted  by  (3. 1 1)  at 
higher  values  of  a .  For  this  reason,  we  advocate  the  use  of  a 
feedback-driven  caching  mechanism  to  increase  the  number 
of  informed  nodes  at  little  or  no  energy  cost  to  the  network. 
The  purpose  of  this  mechanism  is  to  decrease  the  energy  ex¬ 
pended  by  the  network  to  answer  future  queries;  it  is  also 
useful  if  the  magnitude  of  n  is  unknown  during  the  network 
design  phase. 

This  feedback-driven  caching  mechanism  is  described  as 
follows:  once  a  QN  locates  an  informed  node,  the  actual  total 
number  of  query  transmissions  required,  xa  ^,  is  compared 
to  the  number  of  query  transmissions  expected,  E[Xa  ^]. 
Assuming  the  OQN  becomes  an  informed  node  upon  receiv¬ 
ing  the  response,  a  value  p ,  0  <  p  <  1 ,  is  computed  by 


p  =  max 


Xg,8'  ~  2E[Xajr]  1 

Xa,8'  ’  E\Xa,§']  j 


(4.1) 


Intermediate  nodes  at  each  hop  in  the  response’s  path  add 
the  information  contained  in  the  response  to  their  own  event 


tables  with  probability  p.  Although  not  shown  here,  our 
experiments  indicate  this  feedback  mechanism  provides  a 
significant  decrease  in  total  energy  expenditure  for  subse¬ 
quent  queries  at  the  expense  of  total  available  network  stor¬ 
age  capacity.  Alternatively,  nodes  recognizing  a  higher-than- 
expected  number  of  queries  for  a  particular  agent  might  also 
forward  the  high-demand  agent  autonomously  to  inform  a 
larger  portion  of  the  network,  thereby  increasing  the  proba¬ 
bility  that  additional  nodes  are  capable  of  answering  a  query. 
Additional  energy  savings  may  also  be  realized  by  aggregat¬ 
ing  updates. 

4.4  Performance  variance 


The  mathematical  model  and  our  simulation  results  indi¬ 
cate  the  variance  in  the  total  energy  consumed  to  generate 
a  response  can  be  large,  especially  at  smaller  values  of  a 
and  S'.  Although  we  found  no  mention  of  a  variance  analy¬ 
sis  of  total  energy  expenditure  in  the  literature,  our  results 
can  be  generalized  to  any  rumor  routing-based  search  algo¬ 
rithm.  However,  as  shown  in  Fig.  18,  the  variance  of  total 
energy  expended  (and,  hence,  the  number  of  transmissions 
and/or  latency  required  to  answer  a  query)  is  inversely  pro¬ 
portional  to  a.  Therefore,  if  an  application  requires  a  query 
to  be  answered  within  a  specific  number  of  transmissions 
(or,  alternatively,  specifies  a  maximum  latency)  with  a  given 
probability,  the  requirement  can  be  met  by  adjusting  a  ap¬ 
propriately.  The  cost  of  increasing  a,  however,  is  an  increase 
in  mean  per-query  energy  consumption  and  a  decrease  in  the 
total  effective  storage  capacity  of  the  network.  The  predicted 
variance  based  on  the  choice  of  a  can  be  determined  by  eval¬ 
uation  of  the  following: 


k 

Var[X„,a/]  =  ^72.Pr{X«ia/  =  y} 
j= i 

k 

-  Pr{*M'=7} 

-;= i 


(4.2) 


In  Fig.  18,  the  observed  variance  of  T  in  our  simulations 
is  generally  higher  than  that  predicted  by  (4.2)  at  lower  a  be¬ 
cause  a  query  is  dropped  if  it  attempts  to  travel  beyond  the 
defined  network  boundaries.  When  a  response  fails  to  arrive 
after  the  expiration  of  a  timeout  period,  the  OQN  may  reis¬ 
sue  the  query  along  new  randomly-chosen  trajectories  until 
a  response  is  received.  (This  is  the  approach  we  take  in  our 
simulations.)  However,  if  a  node  chooses  random  trajecto¬ 
ries  for  reissued  queries  that  result  in  similar  paths  through 
the  network,  redundant  querying  of  nodes  can  result.  Thus, 
it  may  be  prudent  to  limit  a  node’s  range  of  available  tra¬ 
jectories  in  the  event  that  it  must  reissue  a  query.  Addition¬ 
ally,  the  predictive  value  of  the  model  could  be  improved 
by  incorporating  the  probability  of  a  query  encountering  a 
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Fig.  15  TSBQ  with  agent 
trajectory  routing,  5000-node 
network,  8'  =  21, 

Ercv/ EXmt  —  0-2 


Fig.  16  TSBQ  with  agent 
trajectory  routing,  10000-node 
network,  8'  =  32, 

Ercv/  EXmt  —  0.7 
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network  boundary.  We  discuss  this  improvement  in  the  next 
subsection. 

4.5  Network  boundaries  and  the  analytic  model 

The  mathematical  model  (3.11)  can  be  improved  by  ac¬ 
counting  for  the  effect  of  a  query  encountering  a  net¬ 
work  boundary  prior  to  locating  an  informed  node.  This 
requires  determining  the  mean  hop-distance  between  a 
randomly-chosen  node  and  a  random  point  located  on  the 
network  boundary.  If  d  is  the  straight-line  distance  between 
a  randomly-chosen  node  and  a  random  point  on  the  network 
boundary,  the  expected  number  of  hops,  ft,  before  a  query 
encounters  a  boundary  is 

^  Springer 


(4.3) 


where  R'  is  the  mean  distance  between  transmitter-receiver 
pairs.  Assuming  a  network  of  sufficient  density,  R ’  is  ap¬ 
proximately  equal  to  the  node  transmission  range  R  when 
using  MFR  routing.  The  value  of  d  can  be  determined  math¬ 
ematically  or  via  Monte  Carlo  experiments.  For  example,  in 
a  square  w  x  w  deployment  region  such  as  those  used  in 
our  simulations,  d  is  approximately  0.65 u;.  A  query  that  en¬ 
counters  a  boundary  is  expected  to  have  checked  /3  •  8'  nodes 
unsuccessfully.  Therefore,  the  probability  of  an  OQN’s  orig¬ 
inal  query  encountering  a  network  boundary  prior  to  locat¬ 
ing  an  informed  node  is 
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Fig.  17  TSBQ  with  agent 
trajectory  routing,  20000-node 
network,  8r  =  39, 

Ercv/ EXmt  —  0.7 


Fig.  18  Predicted  vs.  observed 
variance  of  T,  N  =  20000, 
n  =  1,8' =  39 


a 


MXa,8'  >P)  = 


aN 
N-  1 


P-&' 


(4.4) 


If  the  OQN  is  permitted  to  reissue  failed  queries  using 
an  unrestricted  range  of  trajectories,  the  expected  number  of 
query  attempts,  n\  to  locate  an  informed  node  is 


/ 

n  = 


aN  yA”1 


(4.5) 


Because  the  OQN’s  choice  of  trajectories  is  not  restricted 
in  these  experiments,  there  is  a  non-zero  probability  of  over¬ 
lap  in  the  regions  of  subsequent  query  transmissions.  There¬ 
fore,  we  introduce  a  term,  f ,  to  account  for  the  energy  ex¬ 


pended  due  to  nodes  being  polled  more  than  once  in  the 
event  a  query  is  reissued.  The  value  of  f  is  a  function  of  both 
the  density  and  transmission  range  of  the  nodes,  and  f  >  1 . 
Using  a  least  mean  squares  analysis,  the  value  of  f  for  our 
20000-node  network  simulations  is  approximately  1.438,  in¬ 
dicating  43.8%  of  the  nodes  polled  by  all  reissued  queries  re¬ 
ceived  the  query  transmission  more  than  once.  Fortunately, 
the  additional  energy  expenditure  due  to  repeated  polling 
of  nodes  is  only  significant  at  small  values  of  a .  At  higher 
a,  n'  ~  1;  hence  f  has  little  effect.  For  example,  using  the 
value  of  o'*  shown  in  Fig.  5  for  the  20000-node  network, 
n '  ~  1.0314;  thus,  only  3%  of  original  queries  fail  to  locate 
an  informed  node.  The  revised  model  for  the  expected  total 
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Fig.  19  Revised  TSBQ 
performance,  20000-node 
network,  n  =  1,  8'  =  39 


energy  expenditure  is 

E\T\  =  (otN  —  1  +  n)(Exmt  +  Ercv) 

T"  (f  ■  n  •  (n  —  1)  •  f5)Exmt 

+  (? -n-(nf -l).p-8')Ercv 

+  [2n£xmr  +  (n  ■  (S'  +  1  ))Ercv]E[X^s,],  (4.6) 

where  s,  is  the  expected  number  of  hops  required  to  lo- 
cate  an  informed  node  when  network  boundaries  limit  the 
maximum  distance  each  query  may  traverse,  and 

£ 

E[X^s,l  =  J2j  -MXa, S'  =j}- 

j= 1 

As  seen  in  Fig.  19,  (4.6)  provides  a  better  prediction  of 
the  total  energy  expended  by  the  network  at  small  a  than 
(3.11).  However,  (3.11)  still  provides  an  accurate  means  to 
estimate  the  values  of  a*  and  <$'*  that  result  in  the  least  total 
energy  expended  without  the  need  to  determine  f . 

5  Conclusions  and  future  work 

We  have  developed  a  mathematical  model  of  a  new  search 
protocol,  TSBQ,  which  is  used  to  minimize  the  total  energy 
expended  to  advertise  services/data  and  respond  to  queries 
in  large-scale,  high-density  WSNs.  This  protocol  is,  to  the 
best  of  our  knowledge,  the  first  search  protocol  to  take  ad¬ 
vantage  of  the  energy  efficiency  of  broadcast  transmissions. 
We  developed  a  mathematical  model  that  predicts  the  ex¬ 
pected  total  energy  expenditure  of  TSBQ  and  optimized  the 
model’s  parameters  for  minimum  energy  expenditure.  This 


model  enables  the  network  designer  to  consider  the  effects 
of  node  density,  memory  capacity,  data/service  popularity, 
and  latency  on  the  total  energy  expended  to  answer  a  query. 
Finally,  we  analyzed  the  performance  variance  of  TSBQ 
and  provided  a  feedback-driven  caching  mechanism  that  im¬ 
proves  search  performance  at  negligible  additional  energy 
cost  to  the  network. 

The  mathematical  model  of  total  energy  expenditure  can 
be  extended  to  encompass  more  general  search  protocols 
and  network  application  requirements.  For  example,  if  a 
node  needs  frequent  access  to  a  particular  service,  the  most 
energy  efficient  strategy  is  to  locate  the  service  in  close  prox¬ 
imity  to  the  node.  The  model  can  be  modified  accordingly, 
thereby  increasing  the  probability  of  locating  the  service  at 
a  nearby  node.  Additionally,  if  improved  agent  dissemina¬ 
tion  algorithms  are  developed  (i.e.,  methods  that  result  in  a 
more  uniform  initial  distribution  of  informed  nodes),  these 
algorithms  can  be  incorporated  into  our  model.  Finally,  the 
mathematical  model  can  be  easily  modified  to  evaluate  the 
optimum  transmission  range  for  networks  of  nodes  that  have 
the  capability  to  vary  transmission  power. 

Mobile  and  three-dimensional  networks  will  become 
more  prevalent  and  grow  in  importance  as  wireless  sensor 
technology  advances.  However,  to  date  there  is  little  discus¬ 
sion  of  search  algorithm  performance  in  three-dimensional 
wireless  sensor  networks.  We  plan  to  study  the  performance 
of  various  algorithms,  including  rumor  routing,  in  such  de¬ 
ployment  spaces.  Furthermore,  we  see  merit  in  extending 
our  study  of  search  algorithm  performance  to  mobile  envi¬ 
ronments.  Finally,  we  are  interested  in  the  effects  of  finite 
agent/query  lifetimes  on  optimal  replication  levels. 
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